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Abstract 


Background / Objectives: The size of a software project is the main factor for calculating the 
time, effort and cost for a developing software system. The scheduled completion of a software 
project is mainly depending on earlier estimation of project effort, time and cost. 

Methods / Statistical Analysis: The estimation of these software metrics is giving rational 
way of development of software. The traditional software industry is using many sizing 
techniques like Lines of Code (LoC), Function Points (FP), Feature Points (Fe. P), Use Case 
Points (UCP), Object Points (OP), Internet Points (IP), etc. 

Findings: These methods are not giving actual size of modern artificial intelligence and data 
analytics software applications. The over or under estimation of size leads the delay in delivery, 
over cost and even suspension of a project. 

Applications / Improvements: This research article is analysing all the existing size 
estimation techniques related to artificial intelligence and data analytics applications. 


Key words: Function Point Analysis (FPA), Software Size, Artificial Intelligence, 
Data Analytics, Software Size estimation techniques. 


1. Introduction 


One of the important processes of software engineering is software sizing. It is used to find 
the size of a software. The size is one of the major factors for determining cost, effort and 
budget in the software development process. Software project management is one of the fields 
of project management. It is analysing, designing, implementing, testing and maintaining the 
software project. Without an effective planning mechanism, to complete a project with in time, 
budget and effort is a difficult task [1]. The software size is acting as the base factor for 
determining effort, budget, time, schedule, etc., [2]. The software industry is using various 
generic and particularistic sizing techniques like Lines of Code (LoC), Function Point Analysis 
(FPA), Future Points, Object Points, Use Case Points, Internet Points etc. These techniques are 
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programming language dependent and technology oriented [3]. These existing software sizing 
techniques are not opted for finding the size of modern software like Artificial Intelligence and 
Data Analytics. 


Motivation 


The main factor for the planning activities of software development process is software size 
[5-10]. The modern software is an amalgamation of software and other Engineering disciplines. 


It includes Application Programs, Embedded Systems, Networking, Cloud storage, Data 
Mining and Data Warehousing, Big Data, Data Analytics, Machine Learning, Artificial 
Intelligence, Enterprise Resource Planning, Service Oriented Architecture, E-Commerce, 
Machine to Machine Communications, Designs and Animations. The software development 
process is under crisis due to inadequate methodologies for finding software size. 


2. Literature Review 


The beginning eras of software industry considers only lines of codes present in a software 
project. According to Capers Jones [1-5], the lines of code was the best practice for finding the 
size of the software because the homogeneity of the software development process in 1947 to 
1957. The advent of new software like COBOL, FORTRAN (1957 to 1967) increased the 
hands of software development and started to develop many new applications in procedure 
oriented and intermediate programming language domains. This extension is multipurpose and 
complexity of the code is differed for application to application. So that, the existing lines of 
codes is not opt for finding the size of software products. 


Expert judgement is another software sizing method introduced by Helmer of RAND 
corporation in 1950s. The support of an expert is required for finding the size of the software 


[2]. 


The size of scientific software applications is estimated using operands and operators used in 
the application. This concept was proposed by M.H. Halsted in 1969 in his software science 
matric [2]. 


The advent of programming languages increased the code complexity and number of lines of 
code in the software development process. Also increased the effort level of the software 
development process [1]. 


In 1965, a new metric named “Function Points” was introduced by one of the IBM researchers 
Allan Albrecht [1-5]. The function points consider Inputs, Outputs, Inquiries, Logical files and 
Interfaces. This is the first method determines the size and complexity based on functional 
values. 


Mehwish Nasir, H. Farooq Ahmad [6] suggested that an individual sizing method is not opt for 
finding the size of a software project. The early identification of size leads the software 
development process in time, cost and effort effective. 


Mahir Kaya et al., [8] software size is the base factor for finding all other metrics of software 
development process. The size estimation at the early phases will reduce time, cost and effort 
complexities. 
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Daniel V. Feren et al., [10] states, a new technique is required for finding the size of modern 
software projects it must be independent of its environment and must be measured the size at 
the early phases of Software Development Life Cycle (SDLC). 


Z. Zia et al., [12] states that for doing software development analysis, project planning, risk 
analysis and budgeting, the software size is a key factor. 


Md. Forhad Rabbi et al., [13] states that the software industry is matured. The earlier estimation 
using the existing techniques may wrong in the later stages. So that, a dynamic effective cost 
estimation technique is required for estimating the actual size of the modern software. 


Edilson J. D. Candido et al., [18] states, the key factor for determining the time and effort 
needed for developing a software project is software size. 


Linda M. Laird [19] states, software size is the base factor for deriving project effort, schedule 
and cost. Staffing, scheduling and cost is derived from effort. 


Daniel V. Fernes [22] comments, various methods and models are available for finding the 
software size. However, the accurate size estimation of software product is uncertain. 


Iman Attar Zadeh et al., [25] expecting a new dynamic software size estimation technique for 
calculating the software size at the early stages of SDLC, because it is very difficult with 
existing estimation techniques. 


The significant models like COCOMO [37]. SLIM [38], Use Case Points, Function Points [39], 
and SEER-SEM [40] are using software size as the main factor for finding cost, effort and time 
for the software development process. Expert Judgement [41] and Delphi Technique [42] are 
using the experiences and communications of an expert team. 


The following Table 3 gives features and levels of influence in each feature with sizing 
techniques. The influence level of each feature is from 0 to 3 (O-Nil, 1- Partial, 3- Full). 


Table 3. Features and Influences 


- Features LoC FP Fe. P UCP OP IP 
1__| Input and Output 0 2 2 2 1 0 
2 _| Level of Logical Files 1 2 2 2 2 1 
3__| Interface Files 0 Z 2 1 1 2 
4 _| Web Pages 0 1 1 1 1 2 
5 | GUI 0 2 1 2 2 1 
6 | Multimedia 0 0 0 0 0 1 
7__| Images 0 0 0 0 0 1 
8 | Documents and Text 1 0 0 0 0 2 
9 Accuracy in sizing with 0 0 0 0 0 0 

modern software 

10 Quality in sizing with 0 0 0 0 0 0 

modern software 

11 _| Reusability 0 1 0 0 2 0 
12 | Database support 0 0 i 0 0 0 
13 | Data Communication 0 if 0 0 0 0 
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3. Common Findings from The Analysis 


From the analysis reports of various techniques and thoughts transparently saying software 
size is the main factor for determining cost, time and effort for software development process. 
The concluded findings are: 


a. 


The productivity is assessed based on software size. 

Software size is an inevitable factor in software development process. 

The existing sizing techniques are particularistic and giving distinct results with 
distinct software. 

Software size as a basic factor for scheduling, risk analysis and project planning. 

If the software development methodology is modified then the sizing technique also be 
changed. 

A single sizing technique is not sufficient for finding the size of modern software 
products. 

The existing sizing techniques are environment centric. 

Existing sizing techniques are not opted for modern software system. 


4. The Limitations of Function Point Analysis Methods with Artificial 


Intelligence and Data Analytics Applications 


Based on the studies with the existing FPA methods and International Function Point 


Users Group’s (IFPUG) FPA method, the following drawbacks are identified. They are, 


1. The choice-based selection (e.g., if structure, Case structure) does not get extra size. 
[ 53-55] 

2. Relocation of User Interface (UJ) essentials without adding/removing/modifying 
some of them is not encompassed in the sizing method. [ 53-55] 

3. If the similar result is generated in several presentations or models, no extra size is 
considered for the several models. [ 53-55] 

4. The database or text files does not present in the FP count. [ 53-55] 

5. The trial versions and model versions of the software does not present in the FP 
count. It is reducing the effort of the developer. [ 53-55] 

6. Internal Inputs (II), Indexed and List data is not getting importance in the FP count. 
[ 53-55] 

7. The System Development Life Cycle (SDLC), Standardisations, Social, Economic 
and Topographical features, distinct topologies, distinct programming languages, 
and drivers are not into consider the FP count. [ 53-55] 

8. The Trivial Function Points (TFP) like Multi Valued FP, Dependent FP and 
Composite FP are not present in the FP calculations. [53-55] 
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9. The cost for estimating FP is high (estimation cost per function point is 4$ to 8$). 


So, the very large-scale projects are not estimated using FP method. [53-55] 


5. Conclusions 


of 


The AI and DA software applications are very large-scale projects with some millions 
lines of code. This software is the amalgamation of distinct of Operating Systems with 


different modules, different types of programming languages and application software 
depends on the module, distinct utility software and drivers, versatile environments, 
different types of network topologies and security measures, different file formats and 
application interfaces, various algorithms and laws, different SDLCs for applications, 
social, cultural and political factors. The existing sizing techniques are not opted for this 
complex software system. So that, the industry required a highly effective but simple sizing 
technique for finding the size of AI and DA software at the initial phases of SDLC. 
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